Questions:
- explain SGD
- explain SVD
- if A is a
matrix, what's the shape of U, V - do you know any properties of these matrix?
- if A is a
- given a function graph with two local minimum, when GD (with fixed lr) fall into the far one?
- which module (homework) you like the most?
- Given a function, calculate the gradient.
- What is normal distribution? (formula)
- GD with backtracking
- PCA
- Sgd, standardisation, bias column, mse and bce, hw2 ex4
- explain backtracking / armijo
- explain convex function
- explain the output of hw1 exercise 2
- some clarification about HW3
- gradient descent/ SGD / Adam with 1 data point, what changes
- MAP
- PCA, the maths behind it, the code analysis and explaining the PCA homework plots. Also, he asked me about clustering and its relation in PCA alongside its maths.
- GD, SGD and Adam on a 1 sample batch. Eckart Young and PCA, and derive e^(log(x^2))
- SVD, Matrix approximation, Young Theorem and 4.2
- hw2 and 4, 2 in general, then explain SVD the formula, why do we do the centroid